-
Notifications
You must be signed in to change notification settings - Fork 126
Add QOI compression #670
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Add QOI compression #670
Conversation
Custom codecs not supported by the Microsoft RDP implementation have rarely been used, but since we're building both the client and server in IronRDP, we're free to experiment! QOI is an excellent choice, it is very simple, and easier to optimize due to its low complexity. IIRC QOI uses RGB, not YUV like most codecs. Color conversion and chroma subsampling alone can take up a lot of the processing time if not properly optimized. There is just one custom codec extension I've seen in the past: JPEG in RDP. XRDP supports it in the server, and FreeRDP supports it in the client as an optional build feature. JPEG has many advantages: it is by far the most widely available image codec, but it is also one for which the most effort has been put into optimizing the implementation (think libjpeg-turbo). For an RDP web client, this also makes it possible to leverage built-in browser support for JPEG decoding, which is a huge advantage over porting and optimizing a custom codec in WASM. In the past, we've designed our own remote desktop protocol (Wayk Now) at Devolutions, and we experimented with many different ways to deal with the codecs more efficiently than RDP. We had GFWX ported in Rust: https://github.com/Devolutions/gfwx-rs GFWX, unlike QOI, is a simplified wavelet codec, which would make it closer to RemoteFX. It is generic and can accept a wide variety of YUV color formats (RGB wouldn't perform well, you're better off doing the YUV conversion to avoid the color channel correlation issue of RGB). It is one of the rare codecs that would enable really nice things like lossless encoding in which you could trim part of the encoded output for lossy encoding. In our implementation, we've used a reversible color conversion (YCoCg-R) due to its reduced complexity (it's only integer operations, not floating point). Unfortunately, we found out JPEG still performed better than GFWX. In theory GFWX offers ways to beat JPEG, but libjpeg-turbo is just hard to beat. If you're curious about the color transformations we've experimented with, we still have them here: I'm open to experimenting with various custom codecs, the ones in RDP are good, but not necessarily special or great. We can definitely beat the standard protocol if we go custom :) |
@awakecoding I added some jpegturbo benchmarks above. For lossless, desktop usage, QOI is very good - jpeg is far behind. Performance with wasm should be ok (rust-qoi compiles for wasm target) We used to have some heuristics in Spice server to detect video zones, and used jpeg aggressively iirc (when not using whole frame gpu video encoding). |
I’m impressed by the characteristics of this codec, especially given how simple the compression format is. As a custom codec, it seems to be a solid alternative to JPEG:
And the Rust implementation is not even optimized yet. For the reasons explained by Marc-André, I don’t think we can beat JPEG in the browser (yet?) even with WASM, but I suspect it’s better than the current implementation which does not rely on the native JPEG implementation of the browser anyway (RLE / RDP 6.0 compression codec → RGB → JavaScript Image object). Definitely does not hurt to use QOI. If we really want to squeeze performance using JPEG, we could consider that too in the future though. Obviously the limitation is that only IronRDP ↔ IronRDP will be supported, but I’m interested in seeing this as an option similar to how JPEG is an option for FreeRDP ↔ XRDP. By the way, do you have an idea how good it performs on the client side (decoding)? Good opportunity to get some some basic benchmarks merged too. |
78bd227
to
68571b4
Compare
@CBenoit I think we should be good to start reviewing this series. I can split various preliminary patches in different MR if you prefer? |
Sounds great! Yes, I would prefer multiple MRs. I’ll do my best to be as responsive as possible! |
Hi @elmarco |
@CBenoit yes, this has been pending for too long. I was waiting for qoi-rust maintainer. To unblock the situation, I decided to fork qoi and released it on crates.io. The qoi-rust maintainer now said he will resume his work, but we can already make progress on this PR without waiting for him. thanks |
Sounds great! I’m starting the review. First, this commit is not following the convention: benches:: fix could not find time in tokio |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
issue: The commit feat(session): add apply_rgb24() to apply non-inverted bitmap
does not add any feature by itself. It does not change any existing behavior, nor it exposes any new item. Looks like internal groundwork for one of the next commit. Should be refactor(session)
. Note that we are using the commits for auto-generating the changelog, and the idea here is that this commit should not be part of the changelog, or at least it should not appear as adding a new consumer-facing feature (in fact we exclude refactor commits from the changelog).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
issue: Invalid commit type for feature(server)!: replace with_remote_fx option with codecs
, use feat
suggestion: Could be a follow up PR, but I think that the crate ironrdp-cfg
may be a better place for client_codecs_capabilities
and server_codecs_capabilities
. It feels off to me to have CLI-related code in the protocol layer.
crates/ironrdp-web/Cargo.toml
Outdated
@@ -18,8 +18,9 @@ test = false | |||
crate-type = ["cdylib", "rlib"] | |||
|
|||
[features] | |||
default = ["panic_hook"] | |||
default = ["panic_hook", "qoi"] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thought: I’m not sure I want to enable qoi by default just yet 🤔
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok, do you think we need a test period?
@@ -620,12 +638,14 @@ pub struct CodecId(u8); | |||
|
|||
pub const CODEC_ID_NONE: CodecId = CodecId(0); | |||
pub const CODEC_ID_REMOTEFX: CodecId = CodecId(3); | |||
pub const CODEC_ID_QOI: CodecId = CodecId(0x0A); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
question: Any rationale for choosing 0x0A? I assume it’s mostly arbitrary? Do you know about other similar extensions? (JPEG?)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's arbitrary, this is just hard-coded values used by the client to reference a bitmap codec (the server doesn't care)
I’m done with the review. As always, excellent job! |
Signed-off-by: Marc-André Lureau <marcandre.lureau@redhat.com>
Add an optional "flip" argument for inverting bitmaps. Signed-off-by: Marc-André Lureau <marcandre.lureau@redhat.com>
Teach the server to support customizable codecs set. Use the same logic/parsing as the client codecs configuration. Replace "with_remote_fx" with "codecs". Signed-off-by: Marc-André Lureau <marcandre.lureau@redhat.com>
The Quite OK Image format ([1]) losslessly compresses images to a similar size of PNG, while offering 20x-50x faster encoding and 3x-4x faster decoding. Add a new QOI codec (UUID 4dae9af8-b399-4df6-b43a-662fd9c0f5d6) for SetSurface command. The PDU data contains the QOI header (14 bytes) + data "chunks" and the end marker (8 bytes). Some benchmarks showing interesting results (using ironrdp/perfenc) Bitmap: 74s user CPU, 92.5% compression RemoteFx (lossy): 201s user CPU, 96.72% compression QOI: 10s user CPU, 96.20% compression Note: the "qoicoubeh" crate is my own fork of "qoi-rust" project. The plan is to switch back to it as soon as the maintainer resume its activites (aldanor/qoi-rust#14). [1]: https://qoiformat.org/ Signed-off-by: Marc-André Lureau <marcandre.lureau@redhat.com>
Add a new QOIZ codec (UUID 229cc6dc-a860-4b52-b4d8-053a22b3892b) for SetSurface command. The PDU data contains the same data as the QOI codec, with zstd compression. Some benchmarks showing interesting results (using ironrdp/perfenc) QOI: 10s user CPU, 96.20% compression QOIZ: 11s user CPU, 99.76% compression Signed-off-by: Marc-André Lureau <marcandre.lureau@redhat.com>
I recently found the QOI image format (https://qoiformat.org/) and was excited by its potential for desktop remoting.
I hacked up a few things to actually benchmark it compared to the other codecs.
Based on a raw recording of a typical Fedora desktop usage (terminal/web), for 730 4k frames:
None: 5s user CPU, 0% compression
Bitmap: 74s user CPU, 92.5% compression
RemoteFx (lossy): 201s user CPU, 96.72% compression
QOI: 10s user CPU, 96.20% compression
UPDATE:
jpeg-turbo 80% 4:2:2: 16s user CPU, 97.92% compression
jpeg-turbo 95% 4:4:4: 22s user CPU, 96.13% compression
jpeg-turbo lossless: 135s user CPU, 86.06% compression
UPDATE2:
tile diff+QOI+zstd stream long window: 11s user CPU, 99.76% compression!!
There is still a lot of various optimizations to be desired regardless of the codecs, but this will hopefully help. Also, we should start thinking how QOI could be adapted for streaming.